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SEQUENCE LISTING 

(1) GENERAL INFORMATION 

(i) APPLICANT: Tang, Y. Tom 

Corley, Neil C. 
Guegler, Karl J. 

(ii) TITLE OF THE INVENTION: HUMAN SMN-LIKE PROTEIN 



(iii) NUMBER OF SEQUENCES: 4 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Incyte Pharmaceuticals, Inc. 

(B) STREET: 3174 Porter Dr. 

(C) CITY: Palo Alto 

(D) STATE: CA 

(E) COUNTRY: USA 

(F) ZIP: 94304 

(V) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Diskette 

(B) COMPUTER: IBM Compatible 

(C) OPERATING SYSTEM: DOS 

(D) SOFTWARE: FastSEQ for Windows Version 2.0 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: To Be Assigned 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 



(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: Billings, Lucy J. 

(B) REGISTRATION NUMBER: 36,749 

(C) REFERENCE /DOCKET NUMBER: PF-0482 US 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 650-855-0555 

(B) TELEFAX: 650-845-4166 

(C) TELEX: 



(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: BRSTNOT24 

(B) CLONE: 3769729 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 

Met Ser Glu Asp Leu Ala Lys Gin Leu Ala Ser Tyr Lys Ala Gin Leu 

1 5 10 15 

Gin Gin Val Glu Ala Ala Leu Ser Gly Asn Gly Glu Asn Glu Asp Leu 

20 25 30 

Leu Lys Leu Lys Lys Asp Leu Gin Glu Val lie Glu Leu Thr Lys Asp 

35 40 45 

Leu Leu Ser Thr Gin Pro Ser Glu Thr Leu Ala Ser Ser Asp Ser Phe 

50 55 60 

Ala Ser Thr Gin Pro Thr His Ser Trp Lys Val Gly Asp Lys Cys Met 
65 70 75 80 

Ala Val Trp Ser Glu Asp Gly Gin Cys Tyr Glu Ala Glu lie Glu Glu 

85 90 95 

lie Asp Glu Glu Asn Gly Thr Ala Ala lie Thr Phe Ala Gly Tyr Gly 

100 105 110 

Asn Ala Glu Val Thr Pro Leu Leu Asn Leu Lys Pro Val Glu Glu Gly 

115 120 125 

Arg Lys Ala Lys Glu Asp Ser Gly Asn Lys Pro Met Ser Lys Lys Glu 

130 135 140 

Met lie Ala Gin Gin Arg Glu Tyr Lys Lys Lys Lys Ala Leu Lys Lys 
145 150 155 160 

Ala Gin Arg lie Lys Glu Leu Glu Gin Glu Arg Glu Asp Gin Lys Val 

165 170 175 

Lys Trp Gin Gin Phe Asn Asn Arg Ala Tyr Ser Lys Asn Lys Lys Gly 

180 185 190 

Gin Val Lys Arg Ser lie Phe Ala Ser Pro Glu Ser Val Thr Gly Lys 

195 200 205 

Val Gly Val Gly Thr Cys Gly He Ala Asp Lys Pro Met Thr Gin Tyr 

210 215 220 

Gin Asp Thr Ser Lys Tyr Asn Val Arg His Leu Met Pro Gin 
225 230 235 



(2) INFORMATION FOR SEQ ID NO : 2 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2426 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: BRSTNOT24 
<B) CLONE: 3769729 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 



TCTTTCATAG AGACTAAAGT TATTCAGCAG GCAGCAAAAT AATCTACTTA AGTCCTGCCT 60 

TTC TTTTTTC ACTTAAAAAA GTGGGTGTGA TAATATCCAG GCTAGCTAGC TGACTAGCTC 120 

CCCGGGCAGT CTATGATAAT CAGAGATAGT CAATTTATTA GGCTGTTTTG CTGAATAAGC 180 

TGGTTCTAAA GGAGGCAGGG GTCAAGTCAC TTGTCTCATA TATTACAGTG GCTCTCTGCA 240 

TCCCCGAAAC GCCTTCCTTC AGTAAGCAGA GTGCTTGAGT GCACCCCATT TGACCTGCTG 300 

ATATGTAGAT CACAACNCCT GATGCTTCCT GGAATTGCCG ATTACTGTAA CTGCTGCCCA 3 60 

TCTGTCGATG AAGGAGCAGT TTCAGAACTC AGACTTGAGG GAGGAAAAGT AATTAATGGT 42 0 

GCCCGGCGTT CCTCCCTTCC CCCTCGCCGC CGAC CGAGTT CTTCCTTTTC AGACCGGGTC 480 

GCCTTGCTGT CGTCGCGGTG ATTTTC CTGC TACTGCTACT GCTGCTGCTG CCACCGCCAC 540 

TACCACTGGG CTCATTTGCC CCGACCCCTT CCCGCCGCCC CGCCCCCAGC CCCACACAAG 600 

ATGTCAGAGG ATTTAGCAAA GCAGCTGGCA AGCTACAAAG CTCAGCTCCA GCAAGTTGAA 660 

GCTGCATTAT CTGGAAATGG AGAAAATGAA GATTTGCTAA AATTGAAGAA AGATTTACAA 720 
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GAAGTTATAG AACTAACCAA AGACCTTCTG TCAACTCAAC CTTCTGAGAC GCTTGCAAGT 780 

i TCAGACAGTT TTGCTTCTAC TCAACCTACT CATTCATGGA AAGTAGGAGA CAAGTGTATG 840 

GCAGTCTGGA GTGAAGATGG ACAGTGTTAT GAAGCGGAGA TTGAGGAGAT AGATGAAGAA 900 

AATGGCACCG CTGCAATCAC CTTTGCTGGT TATGGCAATG CTGAAGTGAC TCCACTGTTG 960 

AACCTCAAGC CTGTAGAAGA AGGAAGGAAG GCAAAGGAGG ACAGTGGCAA CAAACCCATG 1020 

TCAAAAAAAG AAATGATTGC CCAGCAGCGT GAATATAAAA AGAAGAAAGC TTTGAAAAAA 1080 

GCTCAGAGAA TAAAAGAACT TGAGCAGGAA AG AG AGG AC C AGAAAGTGAA ATGGCAACAA 1140 

TTCAACAACA GAGCCTATTC TAAAAACAAA AAAGGCCAGG TAAAGAGGAG TATTTTTGCT 1200 

TCACCTGAGA GTGTGACTGG TAAAGTTGGA GTAGGAACCT GTGGAATTGC TGATAAACCT 1260 

ATGACACAAT ATCAAGATAC CTCTAAATAC AATGTCAGGC ATTTGATGCC TCAATAATCA 1320 

GAAAAACTGT TGGATTTCAT CTCTGCAGGG CTTTACATTT ACCTTTTTAT CCTTATATTT 1380 

TTCTAAAGGT AAATTATTTG TTAGATGAGT AAGCAAGATA CCATTGTCGT CATTGGTTGG 1440 

CTTCAGTAGA ATGAAACGTG AAGAAATTGC ATTTGATAAC TGC TATTC AT TTAACTTTTC 1500 

TCATTATCAG TACCACGGTT CCCTCAAAGT TTGTTGAATA AAGCAACTTT TGTAGATGCT 1560 

GTTTCATACA GCACTTAGAT GAATTATTGA TCTTCCTAAT ATCAGGCGCC TACTTAACCT 1620 

ATGGTGTGTA CTTTTTGTAA GTTGTAACTT GAAATTTTCA GATGC TTTGA ACTTGACACA 1680 

TACTCTAGCA ATTCATTGGA ACACCAAGGC AAAAAC AC C A ACCTGCTAAA AGAGATCTTT 1740 

TCATTTTTCT TATTTTCAGC TTTAAAACTT AGCTGTCGTT CAGTTAAGCT TAAAGATAGG 1800 

TTAATTTGTA AATGGCAAAG TTTGTTTTGA GGTTTTTCCT CAATAACTTG TTTCCCAGGC 1860 

CTATTAGGCC ATCTCTAAAA TTGATCTAGC TGTTTTATTT TTATGTACTC TTAGTTTTAT 1920 

GTAAGAAACC TTAGGATGAG CTCCCTTTTC TAAGGTGTTT TTGTTTTTTG TATGTTTGCT 1980 

TTTTTCCTGT TTTTTGTTTT TTCCATTTAC GGCAGTGGTA CCATGTTTTG GATGTGTGAT 2040 

GTTTATATGG GAGAACAAAA AGCTGATGTA TAGCCCTGTA TACAGTGTAG ATACTATTTT 2100 

TGTAAAAACA CAAGGCTAAA TTAATGAACA AG AAT AC TG A ATATTTCATC ATTAAAAATT 2160 

TCTTGTATTT CTTGTGCATT AATCTGACGA TAATTTCCCT GTATATTATG TTCATTTAGC 2220 

TGTTTGTAAT TTTTGTTAAT TAGATCAGGT TGTCTGCATT TGTTGGTGTA AGTGAACATC 2280 

ATCACAGTTA TCCTGAGTTG AGTTTAAGCC AAATACATGC ATAGAAAAGG GTCTTCCTAT 2340 

TAATGGAAGA AGGTAATTTT TAGGATGTGT ATTATTTCAG TTTTGTATGT TTAACTTTTA 2400 

TTAAATAAAG TGTTTTTAAA ATCTCC 2426 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 288 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: GenBank 

(B) CLONE: 1857114 

(xi) SEQUENCE DESCRIPTION 

Met Ala Met Gly Ser Gly Gly Ala 

1 5 
Leu Phe Arg Arg Gly Thr Gly Gin 
20 

Asp Thr Ala Leu lie Lys Ala Tyr 

35 40 
His Ala Leu Lys Asn Gly Asp lie 

50 55 
Gly Thr Ala Arg Arg Lys Pro Ala 
65 70 
Asn Ala Thr Thr Pro Leu Lys Gin 
85 

Ala Val Trp Ser Glu Asp Gly Cys 
100 

lie Asp Phe Lys Arg Glu Thr Cys 



: SEQ ID NO: 3: 

Gly Ser Glu Gin Glu Asp Thr Val 

10 15 
Ser Asp Asp Ser Asp lie Trp Asp 
25 30 
Asp Lys Ala Val Ala Ser Phe Lys 
45 

Cys Glu Thr Pro Asp Lys Pro Lys 
60 

Lys Lys Asn Lys Ser Gin Lys Lys 

75 80 
Trp Lys Val Gly Asp Lys Cys Ser 

90 95 
lie Tyr Pro Ala Thr lie Thr Ser 
105 110 
Val Val Val Tyr Thr Gly Tyr Gly 
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115 120 125 

Asn Arg Glu Glu Gin Asn Leu Ser Asp Leu Leu Ser Pro Thr Cys Glu 

130 135 140 

Val Ala Asn Ser Thr Glu Gin Asn Thr Gin Glu Asn Glu Ser Gin Val 
145 150 155 160 

Ser Thr Asp Asp Ser Glu His Ser Ser Arg Ser Leu Arg Ser Lys Ala 

165 170 175 

His Ser Lys Ser Lys Ala Ala Pro Trp Thr Ser Phe Leu Pro Pro Pro 

180 185 190 

Pro Pro Met Pro Gly Ser Gly Leu Gly Pro Gly Lys Pro Gly Leu Lys 

195 200 205 

Phe Asn Gly Pro Pro Pro Pro Pro Pro Leu Pro Pro Pro Pro Phe Leu 

210 215 220 

Pro Cys Trp Met Pro Pro Phe Pro Ser Gly Pro Pro lie He Pro Pro 
225 230 235 240 

Pro Pro Pro He Ser Pro Asp Cys Leu Asp Asp Thr Asp Ala Leu Gly 

245 250 255 

Ser Met Leu He Ser Trp Tyr Met Ser Gly Tyr His Thr Gly Tyr Tyr 

260 265 270 

Met Gly Phe Arg Gin Asn Lys Lys Glu Gly Lys Cys Ser His Thr Asn 
275 280 285 



(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 294 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 



(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: GenBank 

(B) CLONE: 1314346 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 

Met Ala Met Ser Ser Gly Gly Ser Gly Gly Gly Val Pro Glu Gin Glu 

15 10 15 

Asp Ser Val Leu Phe Arg Arg Gly Thr Gly Gin Ser Asp Asp Ser Asp 

20 25 30 

He Trp Asp Asp Thr Ala Leu He Lys Ala Tyr Asp Lys Ala Val Ala 

35 40 45 

Ser Phe Lys His Ala Leu Lys Asn Gly Asp He Cys Glu Thr Ser Gly 

50 55 60 

Lvs Pro Lys Thr Thr Pro Lys Arg Lys Pro Ala Lys Lys Asn Lys Ser 
65 70 75 80 

Gin Lys Lys Asn Thr Ala Ala Ser Leu Gin Gin Trp Lys Val Gly Asp 

85 90 95 

Lys Cys Ser Ala He Trp Ser Glu Asp Gly Cys He Tyr Pro Ala Thr 

100 105 HO 

He Ala Ser He Asp Phe Lys Arg Glu Thr Cys Val Val Val Tyr Thr 

115 120 125 

Gly Tyr Gly Asn Arg Glu Glu Gin Asn Leu Ser Asp Leu Leu Ser Pro 

130 135 140 

He Cys Glu Val Ala Asn Asn He Glu Gin Asn Ala Gin Glu Asn Glu 
145 150 155 160 

Asn Glu Ser Gin Val Ser Thr Asp Glu Ser Glu Asn Ser Arg Ser Pro 

165 170 175 

Gly Asn Lys Ser Asp Asn He Lys Pro Lys Ser Ala Pro Trp Asn Ser 
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Phe 



180 185 
Leu Pro Pro Pro Pro Pro Met Pro Gly Pro Arg Leu 
195 200 205 



190 
Gly 



Pro Gly 



Lys Pro Gly Leu Lys Phe Asn Gly Pro Pro Pro Pro Pro Pro Pro Pro 

210 215 220 

Pro Pro His Leu Leu Ser Cys Trp Leu Pro Pro Phe Pro Ser Gly Pro 
225 230 235 240 

Pro lie lie Pro Pro Pro Pro Pro lie Cys Pro Asp Ser Leu Asp Asp 

245 250 255 

Ala Asp Ala Leu Gly Ser Met Leu He Ser Trp Tyr Met Ser Gly Tyr 

260 265 270 

His Thr Gly Tyr Tyr Met Gly Phe Arg Gin Asn Gin Lys Glu Gly Arg 

275 280 285 

Cys Ser His Ser Leu Asn 
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